1. Initial Agent's State
2. Agent's state after 1K steps
3. Agent's state after 5 million steps
4. Agent's state after 400 million steps
5. Final Agent's state (after 600 million steps)